Investigating the Use of Chronological Splitting to Compare Software Cross-company and Single-company Effort Predictions: A Replicated Study

نویسندگان

  • Emilia Mendes
  • Christopher J. Lokan
چکیده

CONTEXT: Three previous studies have investigated the use of chronological split to compare crossto single-company effort predictions, where all used the ISBSG dataset release 10. Therefore there is a need for these studies to be replicated using different datasets such that the patterns previously observed can be compared and contrasted, and a better understanding with regard to the use of chronological splitting can be reached. OBJECTIVE: The aim of this study is to replicate [17] using the same chronological splitting; however a different database – the Finnish dataset. METHOD: Chronological splitting was compared with two forms of cross-validation. The chronological splitting used was the project-by-project chronological split, in which a validation set contains a single project, and a regression model is built from scratch using as training set the set of projects completed before the validation project’s start date. We used 201 single-company projects and 593 cross-company projects from the Finnish dataset. RESULTS: Single-company models presented significantly better prediction than cross-company models. Chronological splitting provided significantly worse accuracy than leave-one and leave-two out cross-validations when based on single-company data; and provided similar accuracy when based on cross-company data. CONCLUSIONS: Results did not seem promising when using project-by-project splitting; however in a real scenario companies that use their own data can only apply some sort of chronological splitting when obtaining effort estimates for their new projects. Therefore we urge the use of chronological splitting in effort estimation studies such that more realistic results can be provided to inform industry.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the Use of Chronological Splitting to Compare Software Cross-company and Single-company Effort Predictions

CONTEXT: Numerous studies have investigated the use of cross-company datasets to estimate effort for single-company projects; however to date only one has compared the effect of using a chronological split instead of a random split to assign projects to a training set and a validation set, finding no significant differences. OBJECTIVE: The aim of this study is to extend [15] using a project-by-...

متن کامل

Using Chronological Splitting to Compare Cross- and Single-company Effort Models: Further Investigation

Numerous studies have used historical datasets to build and validate models for estimating software development effort. Very few used a chronological split (where projects’ end dates are used so that training sets only contain projects that were completed before the start date of each project in the validation set), and only one compared chronological split to random split. Therefore the aim of...

متن کامل

Fuzzy Queuing Approach for Designing Multi Supplier Systems (Case: SAPCO Company)

  The importance of reliable supply is increasing with supply chain network extension and just-in-time (JIT) production. Just in time implications motivate manufacturers towards single sourcing, which often involves problems with unreliable suppliers. If a single and reliable vendor is not available, manufacturer can split the order among the vendors in order to simultaneously decrease the supp...

متن کامل

Investigating the Risk of Paying Loans to Public and Private Companies Using the Logit Model and Comparing it with Altman Z (Case Study: A Private Bank in Iran)

The design of a credit risk measurement model in the monetary and banking system will play an important role in increasing the profitability of banking resources. This article attempts to use two models of Logit and Z Altman to determine and predict the credit risk of facilities provided to legal entities at a private bank in Iran. The variables studied in this research include qualitative vari...

متن کامل

Technological capabilities assessment in the transformer industry: A case-study of investigating technological capabilities in Iran-Transfo Company

In this research is presented a method for technological capabilities assessment in a company in transformer industry. Used and discussed criteria in this technique are merely developed to assess technological capabilities in transformer industry. Based on literature, technological capabilities in transformer industry are divided into three main categories: production capability, development an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009